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Descripcion 

GLOBAL BUS SYNCHRONOUS TRANSACTION ACKNOWLEDGE 
WITH NONRESPONSE DETECTION 

5 

TECHNICAL FIELD 

The present invention relates to integrated 
circuit architectures having an on-chip high, speed bus 
with multiple medium speed devices, on or off the chip, 
10 attached co the bus, and in particular relates to command 
or data transfer between devices over the bus and to 
handshaking methods and circuitry for acknowledging re- 
ceipt by a target device of a command or data cackec 
placed on the bus . 

15 

BACKGROUND ART 

In typical bus systems, the bus is at the same 
speed or slower than the. devices attached to it. The 
system bus is locaced on a printed wiring board, with 

2 0 processor ^and memory chip modules being bonded to -the - 
board, and the bus - is subject to ckpacitance and induc- 
tance delays that slow information transfer > over: -the- bus 
becween the various chips. '■■ In -such systems, it is the 
bus rather than the device's on the bus which" are the 

25 primary bottleneck in information transfers,, and calcula- 
tions of latency and bandv/idth, are concerned with' arbi- 
tration delays for obtaining access co the bus. 

When entire systems, "cr significant , port ions 
thereof, are integrated on a chip, the bus itself may 

30 also be integrated onto the chip. Such on-chip buses are 
very fast, tvpically about six to ten times faster than 
those located on printed v;Lrinc boards. An. on-chip bus 
operating at a clock rate ,of 640 to 300 MHz can transfer 
data at a rate of about 4 co 5 GByces/sec. At that speed 
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the bus is so fast it is effectively transparent. The 
bus is significantly faster than even the fastest target 
device attached to the bus. For example, a DRAM has a' 
peak sustainable volume transfer rate of *0.8 GByces/sec. 
5 Even with two- DRAM modul-es, cheir tocal-' bandwidth is only 
1,6 GBytes/sec/ still signir icant ly less than the bus 
bandwidth. -This -means ■ that the speed of the system is ■ 
not limited by the speed of the bus, but by the speed of 
the target devices' on^ the ' bus '.- 

10 - In order -co avoid having one device tie up the 

bus while it waits to ■receive- data requested from, another 
device -on the- bus,- a split transaction bus m^ay be used. 
■In this way, the bus can have many transactions in prog-, 
ress at the sam.e tim.e. Each daca read operation occurs 

15 in two steps: read initiation followed by read coraple- 
tion. There is a delay between read initiation and read 
completion. This delay is the 'time -required for the 
target to' decode the request-, gef~' the requested data and 
send it back to- the request ing • device (miaster) . During 

20 this time, neither che master device nor the- t-arget de- 
vice is on the bus. Rather, after the miaster device has 
sent its data read command in a first- bus cycle-,' ic then 
releases the bus. ■ Thus, while the master device is- wait- 
ing for the completion of its read, the bus can support 

2 5 other transactions. Meanv/hile, the target device oro- 

cesses the received request, and only when the read data 
is ready does it arbitrate for the bus and send zhe re- 
quested data to the mascer device-. The transfer- of the 
daca to the requescing device completes the read cycle . 

30 • One problem that can occur with split transac- 

tion buses is that of a non-existent target device. If 
there is no device to receive a" command, then d-aca does 
not come back.' However, since- split transaction buses 
normally have a delay between" a' -read comm.and and eventual 
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receipt: of data, a nonresponse can go unnoticed. The 
requesting, device continues to- wait -indefinitely . What, 
is needed is a handshaking - method that provides a trans-, 
action acknowledge by the. target device-. It is. desired 
5 that the master- device- get a indication, v/ithin two clock 
cycles of , sending a request ; that _ the; designated target 
device has received that- request^. ..This,.;requi:rement of 
essential.ly immediate feedback is tough- -to -do on a- split 
transaction bus without tying up;, t-he^ bus- :f or.v the. t ime 

10 required to return an acknowleagmento- or al-fcernat iyely 
requiring- the target to arbit-rate.- for . the bus for an- 
acknov;le.dgment cycle separate- from the data .return cycle - 
or cycles . . r 

-In U.S. Patent No. 5.,;6c-o:;,.559 , Wispr^ et. al ; : 

15 describes a.systemi in which periph^al devic^e-s receiving 
data provide an acknowledge, signal -to the- .-central -unit. 
A cime-ou- counter is -pr-ovided, ,^:and- if .the time_-out pe-- 
riod expires, prior to -return of an^ .acknowledge signal, 
che control unit asserts, an error -flag^ and initiates an 

20 interrupt routine..- - • - . ■ 

It is an object of. the present invention to 
provide, a -jsynchronous transact-ion acknowledg.e: circuit- 
with- -nonresponse d;etection for a fast split- transaction 
bus . : . .. . ... 

SuT'Ii^ARY p?- THE INVENITION • - • 

The object is met by providing the bus with a 
separate; transaction acknowledge line, by providing each 
target devi-ce v/ith a. driver -circuit that flips the cur- 
30 rent state of the transaction acknc^vled-ge line- to its 
■opposite state \vhenever the target device receives a 
command intended for it-, and by providing the bus sy^tTrm . 
with an acknowledge detection circuit that looks for 
v/hether the transaction acknovvledge line's state has 
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flipped: This scheme provides' immediate feedback co che 
requesting-- master device that its command has been' re- 
ceived by the designated target device. If the state of 
the transaction acknowledge line remains unchanged, a 
nonexistent target device is vindicated . 



vided to drive the transaction ackncv/ledge line when' no '■ 
other device is ' driving " the ■ bus . In one embodiment, the 
BIDD may include a circuit that detects a nonrespohse 
from a nonexistent target device and which then generates 
a dummy response for the requesting master device. The 
dummy data is flagged to indicate "that' it is not the 
requesced data. Alternatively, detection of- the absence 
of a transaction acknowledge may be carried out by a'-' 
detector in the bus interfaces of every master device. 

BRIE? DSSCRIPTION OF THE DRAWINGS 

Fig. I is a schematic block "diagram -of an inte-— 
grated multi-processor system vjith'a high speed split-' 
transaction bus, in whicri the synchronous 'transaction 
acknowledge ■ with ncnresponse decection of the present 
invention may be located. ' . . 

Fig. 2 is a schematic block diagram of a pro- 
cessing cluster in the system; of Fig. 1, with a global ■ 
bus interface containing the ■ eransacticn acknowledge of 
the present invention. - 

Fig. 3 is a detailed block diagram of the 
global bus -interface 17 '-of- Fig'. 2, showing the transac- 
tion acknowledge generator 79' in the target - interface . 



A bus 'idle default- device (BIDD) may be pro- 



Fig. 4 and 5 are timing diagrams of write and- 



read cransfers, respectively, on the global bus '15 in 
Figs. 1-3, wich "he cransacticn acknowledge (signal 
TA.CK#) indicated as a flip in the signal scate. 



10 



20 



25 



30 



WO 00/60462 W W rCT/USOO/09155 



Fig. 6 and 7 are block circuit: • diagrams of, 
transaction :acknov/ledge - (TACK), generating- and detecting- 
logic-, . respectively. - 

• Fig.- 8 is a-detai-led block diagram of a-bus 

idle default device (EIDD part .oj the global bus -con--, 
trol- unit 13-. of Fig., l, r containing: the, no; TACK detector 
of fig . . 8 . • ' . r ^ . : ■ - - - 

is a timing diagram-illustrating the 
response of the. BIDD of Fig. 3; to a- no, TACK., detection . 



BEST MODE OF -CARRYING OUT. THE INVENTION • • . 

. : r . With reference,, to Fig.-l, an integrated circuit, 
11 forming a multi-processor system has a plurality of • 
processing cluscers 13o - 133 . (here,, f our in number)..; as - 
input /output (I/O) cluster 14, and an SDRAM memory con- 
troller 15, all attached co an. on-chip; high . speed global.' 
bus 16 by means of bus interface units 17. ^ A ty;:ical 
sysEem may have the global bus .15 operate at a 640 MKz 
clock- rate, while the clusters 13 -15 -operate at . a clock 
race which is half that 7 i.e. 320 MHz. -,A global -bus 
control unit IS includes a bus arbiter, regu^^ating access 
to the bus 16 -by the various clusters 13-15, and also 
includes 'a bus -idle . default device, (3IDD) . for use when -.no . 
cluster element. >s driving the, bus.;. The- I/O cluster 14 
and SDRAM controller 15 communicate with off -chip devices 
through an I/O bus 19 and -prog ram.mable I/O subsystem 20 
connecting to I/O- pads 2 1 - of ^ the: chip and to one ;or more 
SDRAJ^l memory chips 22... The present invent ion focuses 
principally on , the global: bus 16, the bus interface unics 
17, and the 5IDD device- in the global bus control unit 
13 . 

Reierring cc ?ig. 2, the integrated circuit's 
bus structure consists of a single global bus 16 and a 
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local bus 29 for each, of che plurality of clusters 13-15 
attached to- the global bus 16 as in Fig. 1. Each pro-, 
cessing -cluster 13 includes a plurality of processing 
functions, such as processing elements, digital signal - 
engines, memory transfer . control engines and associated 
cluster data and instruction memories, caches and regis- 
ters,- all attached.-to the . local bus 23 of the" cluster- 13 . 
I/O clusters (14 in Fig. 1) are similar, except that I/O 
transfer engines i~eplace the digital signal engines and 
memory transfer control ' engines , and an I/O bus (IS in 
Fig, 1.) .also, interfaces with the local bus 29. The buses 
IS and 29 allow r the various- elements on the bus to trans- 
fer information (data, instruc" ions , ■etc.) Bus elements 
consist of two tyr^es : masters 2 5: and targets. 27. Pro- 
cessing elements, digital signal-" engines , memory 'transfer 
control engines and^-I/O transfer engines are examiples- of 
bus mascer devices 25. Memories and registers, including 
cluster data and instruction: memories and cache.s, cluster 
hardware registers for the processing elements ■ digital 
signal engines and memory firansf er -coiitrol engines, as 
well as DRAM -memories - and system-"- registers , are examples 
of bus' target devices .,.2 7.. - All. information transfer is 
between miasters and targets, with the miascers initiating 
transfers to. and from targecs. 'All transfers within a 
cluster 13 are carried out over che local bus 29, while 
informiation transfers betv;een clusters, including- with ■ 
zhe 1/0 cluster (14 in Fig. I) and SDRAiVl controller (15 
in Fig. 1) are carried out over the global bus 16 via 
global bus interfaces 17. The. global bus interface 17 
includes master interfaces 313 with associaced FIFO reg- . 
iscer banks 31A. and targec incerfaces 33B also with asso- 
ciated FIFO register banks 33B. All wrice operations are 
direct transactions from m.aster to target. All read 
operations are split transactions with a comim.and write 
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from master to.cargec to Iniclate the transaction, subse- 
quently, followed by a- separate response write from the 
target back to the originating master to complete the. 
transaction. The global- bus' concrol . (IS in. Fig.. 1) arbi- 
trates among, the master., and target" interfaces 313:and-33B. 
for access to- the global r bus-. 16 and provides ..clocking .for 
data, transfer • between the. master and--: target- FIFOs -"3 lA and 
3 3A. . . ■ • ^ . - r : : . . . : 

- : ■ With .reference to Fig.:;3., the 'global.- bus :inter- • 
face 17, includes a master interface.: 3 1 : and a target .in--" 
terface-33.. The master interface 31 initiates.transf ers 
and the target interface 33 responds cc- transfer requescs 
received from a master., interface.. 3'L. : Most global bus 
interfaces 17 have -both master andr target .■ interfaces 31 
and. 3 3", • although „ some.- devices on the..global bus lS,*such 
as a register bank or. a. memory: could have.. only a.tarQet.. 
interface 33. The bus . system- uses uniform addressing- • 
wich a. single 32-bit address for- all bus elements. Any 
bus master . elemient can address any ocher bus ,taraet ele- ■ 
mient using . the- target element ' s. bus address. - Accord- 
ingly, each global bus master- interface -3 1 has a .unique 
hardware-assigned device, number, -called- "My Device Num- 
ber", .stored in a. register 41..' This-rnumber indicates ;the ; 
unique- interface- 31 that is to receive data in .a global:, 
bus transfer.. It. -is a hardware port numJDer and will 
never be generated by nor . visible to the programmer . - . ; 
Each target . interface 33 also has a range- of global bus 
addresses, called "My- Global Address Range!', that identi- 
fies the addresses to: which .the target will respond." 
This address range is likewise scored in a register 43" in 
the targec interface 33 . . 

The global bus 16 is a single transaction 
write, split transaction read. bus. it is a 64-bit bus, 
w^ith 32-bit addresses and 6H-bit data transfers.. Each 
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bus cycle specifies che transaction type (idle, command, 
data, last 'data) , a bus device to receive the information 
and 64 bits of command or data. Command octets contain 
the -command information (read'/writ-e , etc.) and a 32-bit 
5 transfex address." The destiriation to receive the data- 
(either a target" device recciiv'i'ng-- -a read^ command or a . 
write ccmLmLand plus write data, or a master device receiv- . 
ing data returned by the target device) can either be a 
specific device or a broadcast to all devices (designated 
10 as "device 0"-) . The recommended global bus transfer atom 
eight' 'Words of four by^es each, which results in four 
bus ccteus of eight bytes (64 b"i'ts)' each, with one, tv/o 
and four octet transfers as special cases . A' four, octet: 
data transfer -has a bus efficiency of 80% (one- comm.and - 
15 octet per -four data octets) " All transfers are wrices to 
a FIFO (56, 63, S2, 85 'in Fig. 5) in ■ the- -global bus in- 
cerface 1-7- on the bus 16. Addresses and data are ■ 
T^ipelined. ■ All data transfers on the bus are 64 -bit bus- . 
octet transfers 'with naturally aligned addresses. - Trans- 
20 fers can start at any address. -Data is transferred syn- 
chronous ■■to a bus clock, with the FIFO registers in each 
bus interface device 1-7 functioning to buffer the address' • 
and data information to and -from -^^t he global bus^ 1.6, 
utainly to compensate for clock speed 'differences and ~skew 

2 5 between the data source and destination. The^ FIFO regis- 

ters can add pipeline " delay of up -to 4 clock cycles be- 
tween the source and destination (2 clock cycles az each 

end) . - ' ' 

■ * The global bus 16 has four information transfer 

3 0 types: data write, data read, control write and control 

read. A data write opera" icn by a bus master sends a 
transfer com.mand in a firsc bus cycle, followed by one, 
two or four data octets m che following cycles. The . 
transfer of che last daca oczez completes the wrice cy- 
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cl=. A- data read operation by a bus master- sends the ■ 
transfer- command in thg .first cycle, then releases the 
bus. The- targeted device- receives the command.- When the 
read data- is ready, the target arbicra-tes for -the bus -and 
sends t-he read - data- to. the bus- ma-ster. i:ndica.ced -in the • •■ -. • 
command octet..- The t-r-ansfei?. of the last data- octet to 
the requesting. -master device completes -the xead cycle.- A 
control write, is. an address. ;variant o£ a data write- oper- 
ation .with a- .single data octet: It. -wr.it es data to .a sep.a-. 
rate 32 -bit: concrol address space .-- -The da t.a/ control bit 
in Che command octet indicates the write to -the -control 
address, -space. All t:arg.ecs receive zhe command .and data 
octet-, comple-ting the .cycle.. Control wrices go to a 
separat.e- data, rsg-iste.r in the interfaces that receive- 
15 them..- This.-.is to prevent command reject by interfaces 
busy with -daca operations. Control writes are used to 
send base address.es to each -.cluster, --and t-o send base •• 
addresses and configuration data .to all other global bus 
devices- such as the global registers. ..-Control write is 
also used .to s.end global ..timing signai.s and globa:l .wake- 
up interrupts to -all clusters. jEach cluster receives- a. 
global- bus control wr.ite of its -cluster base address. 
Upon receiving..the iCluster- base -.address, each -cluster 
sends its. base .address to .a 1.1 it- processing -elements and 
digital s.ignal engines, which store this, address so that 
they can respond to transfer requests to.:their internal 
registers when- the appropriate global , address is oresent 
on the cluster data bus. Control read is a counterpart 
to control write. .Control read allows the host or con- 
30 figuring device to read base address and configuration 
-=g-3cers .in the .global b'us control address 
space as v.'eli as write them. This is reauire-d for PCI • 
configuration registers (such as chose visible through a 
PCI interface co external PCI devices.) 



20 



25 
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Each global bus masuer ha^ only one transaction 
in process, at any one. time. It cannot . initiate another, 
transaction until its current transaction is complete. 
Even, though each master can support only one transaction. 
5 at a time, the bus can have many, transactions .in progress 
at a time/ Each read operation occurs in. two steps:- 
read initiation followed by read complete. There is' a 
delay between read initia-tion and read completion. This 
delay is the time requrred for the target to decode the 

10 command,., get the read data and sent it back to the mas- 
ter. During chis .tim.e, .neither the master nor the taroet 
is on the bus. .While' a master is waiting for comicletion- 
of its read; che bus can support other - transactions . For 
example-, other m.asters can. perform write transfers, and 

IS initiate cither read transfers:. - " .. 

Each global bus transacclon begins with a'comi-' 
m.and octet- written, to a target device. A comma.nd octet 
m.ay include the follo^^^ing fields: a" read/write transfer 
bit, a data/control type bit, a two-bit transfer length 

20 field for indicating ■'to- DRA.M memiories che expected trans- 
fer length in octets- (one, two, four-, or greater "han 
four), a two-bit priority field,'-. two mulcibit -fields^ 
(e.g., -six bits each) designating, respectively; the 
device ■ number .of the originating m.aster interface device 

25 for use by- the target device as a destination in respond- 
ing CO read com^mands and the sub-device number designat- 
ing the specific device within a cluster, and a 32 -bit 
address field designating -the target device address and 
address of the data within the target'. Other fields 'mav 

30 be defined or field sizes excended, if desired, . providing 
tne total size or cne command does not exceed che one 
occec sice established by che global bus. 

Referring again zo che interface structure of 
rig. 3 along with the ciming diagram of Fig. 4, a data 
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wrice operation in which a- maste-r device writes 1 :to 4 
data octets to a designated- target device begins with- 
transfer of a 'command from a master device to the master 
interface 31 via the local- bus 29 to che --m.ast ex. -inter face- 
5 bus 47/. and .then: via lines 51 .to the "command, buffer. 53 . . - 
The master interface-' s device number/ -received by the *. ^ 
commandvbuf f er • 53 via lines 5.2 -from the "My Device Num- 
ber" storage, register 41, . is .appended to. the .command-. in 
the appropriace field. Next,, the master interface 31: 

10 requests -access to the global-- bus:, :as seen-.by. the global- 
bus' request line (GBR^^) going low. at reference numeral 
91 in- Fig .-..4. , The. requ.e.st- i-s ma.de -for. e he . command act. et • 
and also fox- each of- the data occets to be written.. In- 
the example oj Fig, 4 , • the master's request, rsignal stays:-' 

15 low for 5 clock cycles for a 5 octet .'transfer . ~ :The 

global;- bus control's arbit.er (18. in Fig. :!)• grants; access 
CO the. master interface for .the .requested number of cy- 
cles, as:, seen by. the' global bus '• request acknowledge or. 
grant line- (G3A#) going low at reference numeral 93 for 

20 five-clock cycles. The m^-aster -interface .3-1.. X-hen sends. 

the write - command octet and : the- data octets .to the global 
bus via the command-.out lines -54 -in rig. 3.,. and via the 
write data .lines, 57; from a write -FIFO regis.ter bank, 5 6 
comm.uni eating .with- the local bus. 29 via interface :bus. 47 

25 and v/rite -data lines 55,.: This issue of the -writ. e- comu.and 
followed by the required number of data octets is _indi- • 
cated by. octets 95-- 9 9 in. F.ig 4:. * . - 

: The write command: octet , is broadcast tQ all 
global bus. target interfaces -(including its own.).., as 

30 indicated at, 100- in Fig. 4 .by target -device code. (TDev); = 
0. It is a broadcast haca'sse the master does not know 
which global bus device will respond to the address con- 
tained in. Che command octet. - The. command octet- contains 
the 32 -bit global address 101 for the transfer as well as 
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the transfer type (write) and transfer- length (1-4 oc- 
tets) . It 'also contains the master's device number. My 
Device Number, but it is not used in v/rite operations. 
Each target -device -33 receives the write command and 
5 write data in the • target-'^-interf ace ' s 'command buffer 7:2 
via command in lines 71 and in 'the target -interface's 
write FIFO register -82 via write data lines 81', respec- 
tively. It compares 'the 32-bit address in the write" 
command, received by- the compare - circuit 95- via the tar- 

10 get address lines "74, against its' own global address. My 
Global Address," received -by the compare circuit 75 via 
lines 76 fromt the storage register "43 . If there is a" ■ 
match, it accepts the v/rite^daca 102-105 and clocks 'it 
out of its write FIFO 82 over lines 33 This ^terminates 

15 the write operation. If there is- a*'"match but the device ■ 
is busy with a -'previous command, it - sends a command re--- 
ject to Ehe bus.- If there is no match, the -target ig- 
nores the com.mand and flushes the v/rite FIFO 82 "'in prepa-- 
ration f or ' the next v/rite commtand. Note that all writes 

20 are broadcast." Normally only the intended target will 

accept * the broadcast write data; the other devices v.'ill ■ 
discard it However , it is^pcssible to- broadcast write 
data to m.ore than one' target if the targets -are designed 
to decode 'a range 'of broadcast addresses:- 

25 We now consider a miaster data read frcmi a tar- 

get with reference to Figs-. 3 and 5. -The mtascer inter- 
face 31 initiates the transfer by- sending a read comimand 
octet to -the global bus after requesting and receiving 
access c6 the bus, as indicated in Fig. 5 at 121, 123 and 

30 125. The read commiand octet ' is ' broadcast (as indicated 
by device 0 at 12 6 in Fig. 5) to all global bus target 
interfaces (including its owni . It is broadcast because 
the master does not know which global bus device will 
respond to the address -(at 127 in Fig. 5) contained in 
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the command occet . The command octet contains the 32 -bit 
global address for transfer • as well as the transfer type 
(write) and ..transfer length' (1-4 octets). It also con- 
tains the. master's device number, My Device Number, which 
the target. device will .use, f orbits -response.. When .the 
master has sent. _-the ; read command octet , it -arms -its read 
FIFO 6-3. -to .receive the read data, over read ..lines .62 at a., 
later cime.- .The -master at - this .point, norm.all.y -stalls and 
waits f.or the .target .to -send .-.the read data, - completing 
the read --command.. . Each -target in-erface 33 . receives - the 
read com.mand octet .over _command.-in . lines ,71 -.into buffer- 
72. Using the compare .circuit 75, ic,compares the-32-bic 
address -in t-he read command against its own , global ad- 
dress, i.My Global Address, scored, in register 43. If . . . 
there. is -a match, che command is transferred .over -lines 
73 and 77 to nhe interface bus-, 5? and thence to the ..local 
bus 29, i.c gets -the _data requested via the local bus 2S, -. 
interface bus -5.7, read lines 84 -and 8S and read FIFO ■ 
regisc-er bank .SS and sends it to .the global bus 16. ■ 
Afcer requesting and obtaining access; to the global bus- 
15, as indicated. ac. 131 and 133.in.Fig, 5, it sends the - 
data 135-138 to . the _.master that .requested the read data.^ 
by using the m.aster'.s device number contained in. the ' - 
command octet as the response address, as indicated by 
use of the master device code 139 in- Fig-.. 5. This term.i- 
nates the read operation. ■ If there.- is a match but the 
device is busy with a previous com-.and, it sends a com- 
mand reject 145 zo the bus 16. If- there is no- match, che 
tar-get ignores the command. Note that the only. valid way 
that data -140-143 can be senc to a waiting read FIFO 63 -. 
in a master is in response to a previously sent read 
com-,ar.d. Only command - octecs contain - che device nur"b-3r 
of Che mascer chat senc che comma.-d, and chis device 
number is hard wired (41) inco the. master . device serd:.-.-.; 
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Che command. The device number is read cn lines 58 by 
compare circuit 60 and checked against the stored device 
number (41) received by the compare circuit 60 over lines 
59. A match enables the FIFO 63 via control line' 61. 
Ther^ is no valid way' that' some other device could send 
data to -an upen master" read- FIFO, causing improper com-, 
pletion of an open' read command. - ■ . > 

Target ■ devices- receive broadcast writes and 
respond to reads. Alternatively, a master device could 
send its write command and data to a specific taraet 
device instead of broadcasting it, if the miaster knew 
which device was to receive the command. You would do 
tnis to- save power, so no other device would receive the 
ccm;mand and dissipate power as a result.' 

In summary, the basic write transfer ssauencB 
is as follows, using a four-octet data transfer as an ' 
example: 

1. The mas cor device requests a 5 octet transfer on the 
bus . 

2. The m;aster issues the carget bus Device number and 
the write command .-^ The target " Device number may be zero 
(broadcast) if the target bus Device number for the v/rite 
is unknown: The write command contains the write ad- 
cress, write command, comm.and priority, chain 'bits and 
rr.aster device code,- ■ 

3. Issue data octetO-2 (Transfer may be 1-4 cctecs 
depending on transfer length code.) 

4. Issue data octet 3 and the Lasc transfer cype, and 
release the bus. Bus arbicration starts again in this 
cv'cle . 
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1. The master device requests a 1 occet transfer for 
the read command. 

2. The master issues .the target^ bus Device number and 
read command. . The target Device.- number may be .zero 
(broadcast) if_the targec bus, Device number for the read- 
is unknown. The read command, contains the read -address , 
read com.mand, command priority , . chain -bits, and master r 
device code. The m.aster device code .will- . the DR.am re- 
sponse address,. 

3 . Release the bus . 

4 . The cargec device requests ..a 4 octet . transfer for -- 
che read data response . 

5. _The target issues the target device, address and the 
first octet of read data . The master device, code is the 
target for the read data. .Transfer m.ay be 1-4 octets 
depending on transfer length code.. 
£. Issue data octet 1-2. 

7. Issue data octet 3, the Lasc transfer type, and 
release the bus. Bus arbitration starts again in this 
cycle . 

In the context of a syscem_ like that , just de-- • 
scribed, using a split trans.action. bus , . the _ presen- in- 
vention provides a transaccion acknowledge ■ (TAGK) signal • 
-o the bus system to indicate receipt of a command, or 
data by at least one cargec device. In particular, the 
carcet device receiving each octet transferred on the 
global bus IS acknowledges the octet by act ivating , a • . • - 
Transfer .Acknowledge (TACK) line of the global bus 16.. 
This is crue for ea.ch octet tra.nsf erred, command, or data.. 
T^K indicates that the carger has received a command 
octet or data octet in-.e.ndcd for i-. .i^s seen in Fig. 3, 
when a target device 3 3 receives a control read .qr wrice 
octet, it decodes ic co see if it .is the intended carce-. 
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using compare circuit 75 . If it* is, it activates TACK 
(by means of a TACK generator circuit 13 providing a TACK 
signal- on lines 80) two clocks after the octet v/as trans- 
ferred as indicated in Figs. '4 and 5 at 106 and 144 for 
5 uhe TACK signal. The- target activates TACK even if^it 

rejects the command (as at- 111 and 145 in. Figs. 4 and 5).- 
If the command Vs/as -.a * write-, each "of the write data octets 
is also acknowledged by the target (at 107-110 in Fig. 
4) . Likewise/ a master' receiving read data activates 

10 TACK for each octef read (at 145-148 in Fig. 5) . TA.CK 
allows you to 'detect when: no device has responded to a 
command, which is a bus error. TACK deteccs this immedi- 
ately, without' having to wait for a bus time out. TA.CK, 
is valuable for debug; it lets you know if any device 

15 responded. '-More'^' than one device can respond with a- TACK 
signal ' without inter-f erence . - - ... " . 

^■TACK' has- unique coding.-. To activate TACK,, you 
change its "state"' from, the previous clock. For continuous 
TA.CK signals--, the TACK line will -flip on each- clock. 

2 0 Each target device activates TACK for each bus clock. 

Note that -.more' 'than- -one -'device can respond with, a TACK 
sigrial^: All'"" responding aevices will drive- TACK -in the 
same direction-. Figs. '6 and 7 block "diagrams of logic to 
generate 'the TACK signal and to - detect -the" TACK signal. ■ 

25 In the 'generator -logic of 'Fig. 6,- the' Last TACK' flip .flop 
151 'records the TACK signal value for the prior cycle. 
The Decode flip flop 153 records a valid "address - decode 
in the previous cycle. If the -^target 'address was. valid 
in the previous cycle, "this logic responds with a -TACK 

30 signal by enabling che TA^CK driver 155. The TACK driver 
155 uses the inverted output of the flip flop 151 to 
generate the current TACK value, which is the complement 
of the previous TACK value. This TACK generator cir- 
cuitry is part of the- target bus interface 33 of each 
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target device or cluster containing target devices on -the 
global bus . ^. ; * . _ - 

'-■-In the- Detector logic of Fig. 7, the -Last TACK 
flip flop 157 records the-:TACK signal value for the prior 
cycle. :if the current TACK signal value is the comple-. 
ment of che TACK signal .value^ in; the prior .cycle, the - 
current' TACK signal is-valid,.-. a^id the. XOR gate 159 out-, 
puts a " t rue TACK -detected- signal --value. --.e The TACK de- 
tector circuitry may; be part of the-,, master .bus interface 
of each master device or clus.ter Af m:as.ter ; device on the 
global bus. Alternatively, a single . TACK- detector can 
form part of . the bus idle default: device. (3IDp) cf-the 
global bus control (13 in Fig.-. 1-),. In: either case, if the 
bus is idle, the BIDD will activ.ace .TACK and drive the... 
bus to" default, levels.- If a command is issued and no,, 
device responds, the TACK line will: not change. .--This is. 
how you; detect -that, you have addressed- a ..non-exisnent 
device:. " If no device drives TACK-, s-uray capacitance •■ and 
bus hold logic will keep the TACK .line .at/, its previous, 
level . : ' i • : r . 

': \ Each, master device on- the ,G3. can have- only one 
outstanding GB transfer in* progress .-at any one^ tim.e. For 
read trans*f.ers, "the G3 master waits for -read data to be 
returne'd-. . For. write .transfers,- the mascer wai-ts for a 
bus ..grant- for .the command and the absence -of a com^m.and 
reject from - the .bus ..indicating . that the write command and 
data have been --accepted . This provides automatic control 
of the -transfer bandwidth between the master(s) and a. 
target. This . is called self throttling. Each master- 
waits for the .target to -respond. The target may have 
received -many GB transfer commands and be in the process 
of servicing them. These commands are typically buffered 
in a command Flr^O. A target may have N com.manGs in its 
FIFO, from N m^asters. Once the com.mands are in the .FIrO, 
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all N devices will wait until' each of them has had its 
command acknowledged. Because each master will wait - 
however long -- for transfer complete,- ■ no target- can be 
overrun. ' . ■ ■ _ . . - . 

5 ■ With reference to" "Figs. 8 and 9, when the' 

global bus -Is idle> no active " device is selected to drive 
the bus. * If -no a:ccive "device -is ■ -selected, the arbiter 
selects a default device, the Bus Idle Default Device 
(BIDD) , to drive the bus . Otherv/ise , the device lines 

10 would float, 'potentially causing noise- and errors. The 
BIDD drives the bus -lines to valid levels by means of 
idle bus logic and bus drivers 161 responsive to an idle 
grant signal -from che arbiter. It sends zeros' for the 
data word, byte enables and device address ; and- zero for 

15 the Word Type: the idle command. • Alternatively, the 

address/data: 'lines are .'held at their previous values (for 
low -power) ; --the byue enables -to inactive; and the target 
device number to all ones.'" It" "also activates the" TACK 
signal at output 153 because it is a valid device, the 

20 BIDD, and Is validly driving che bus. - 'The only time the 
TACK signal .-is- not driven" is when a commiand or- data word 
is sent -on ' the 'bus ' and no device "responds to- -it . 

The BIDD also responds to read comimands" with ^no 
TACK, -through the TACK detector logic I"0-5 (which is that 

25 shown in Fig. ■?) , indicating that -no' device will respond 
CO the read. Global bus master devices can issue read 
and write commands to -non-exiscent target addresses-' '(-de- - 
vices 36, 15*and 27 in the example of Fig. 9) . In this 
case, no 'device v/ill decode the address, restJond to. the 

3 0 command and issue the TACK signal (as indicated by the no 
TACK responses at 173 in Fig. 9) . The G3 mascer that 
issued che comtmtand will be scalled 'waiting for the read 
data unless it notices che lack cf TACK and aborfs cne 
comimand. The next question is how -o abort the co"mB,r:d , 
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The simplest method is to provide, substitute data (175- 
177 in Fig. 9) and let the command to run to normal com- 
pletion with; a flag that notes that -the data is -not - 
valid. This means no special modification to the receiv- 
ing state, machines (and -other- state machines .-that depend 
on -them), - but -requires , inserting , dummy data.,. In order 
for global bus master :to , do :this > it. would have to re- 
quest .the bus (request at 179 and grant at .18a in, Fig.; 9)^ 
and either put the dummy, data on the bus. to- be received 
by itself or send . 1 -4 .bus. idle cycles. ;it- has to do this 
to hold off the global.. bus while inserting the dummy 
data." Otherwise,, the .global .bus could be . trying 'to put., 
data .in- the FIFO while the global bus master logic was 
inserting dummy data. 

• ;■ Fig. S_ -shows^ the m.icro , architecture - for the 
BIDD wi-th no TACK response logic. The -BIDD monitors the 
devi-ce zero broadca.s.t com^mands ..through^ a buffer register; 
lo7 and checks for a -read command. with no TACK . response . 
In the case of a -np^ .TACK response, (at 133 in /Fig. 9) ,* a . - 
state machine^ -169 requests -the global bus and issues, a 
read response., of •l,-,2, or -4. -octets of- , zero -data , as de- 
termined by the 2 •.leng,th--bi-t3; .in the command.- Tt returns 
a -zero da t-a value and zero, byte, enables, with the appro- 
priate word.' -.type codes for read ^response . - The zero byte 
enables „.i-ndicate, that the data is, invalid. (Read data 
normally, returns data-. -with all byte . enables set to ones.) 
The BIDD also- .responds with the -device -address from- the - 
read command (at 1S3 in Fig.- g) so the dummy data goes -to 
the original request ing> device. - The 3IDD uses -a FIFO 171 
to -hold up, to five read requests from the G3 before the- 
BIDD is granted control cf the bus for the No TACK read 
response. 

Fig. S shov/s a timing diacrami for the no TACK 
response. The BIDD has the highest priority when re- 
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questing the GB to minimize command buffering for read 
commands wich no TACK. Command buffering is reauired 
because \it is possible to have several read * commands with 
no TACK occur in ' succession . With the highest- priority, 
only 3 comm.and.s- need to buffered, corresponding t'o the 
nunir.)er of clocks between detection" of the condition- and . 
putting the ' dummry -read data on the G3;. one' to detect the 
condition, one to issue the GB request and one to receive 
the GB granc' . This .timing diagram in Fig, 9 assumes that 
the 3IDD has the highest priority for the GB arbiter and 
also assumes- .that the BIDD can submit a DC'reauest (179) 
as opposed, to '.a. pulsed 'request. The BIDD can hold the- GB 
request for a longer period than needed -because No tack- 
responses .are infrequent. Once the BIDD read' responses 
have .been .issued, the BIDD can fill in with'idle cycles^ 
if the grant ---t-ima "is" Longer 'than needed": -Several read" ' 
commands: to non-existent 'addresses _ could occur in succes- 
sion, m.eaning that the BIDD has to buff er -these ' read 
com.mands . It has to buffer com^mands until it can gain 
access to the GB. By putting the TA.CK non- response logic 
as the highest priority GB device,- this will ^minimize the 
buffering :to the' number of clocks ..'between the time the-. 
TACK was detected 'and 'the time the GB grant is received; 
This should, be 3 commands: one' --t-o detect it, one' to issue 
the request and one to receive the grant. Note that only 
14 bits need be saved from the command word: the 2 bits 
of the length code and 12 bits 

cf the device and sub device a'ddress for the read re-- 
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- . . : Claims .... - • , 

1.. In- an -integrated circuit having multiple circuit* 
devices, a-tached . to an.. on-chip bus, .transaction 
5 acknowledge circuitry . with nonresponse detection for 

indicating- that a command- placed on said ^bus has not been 
received by a designated targec circuit -device , the 
circuitry comprising: ■ - 

a separace transaction acknov;ledge line - ' 
10 provided with said,. bus; and : ■ - 

drive circuit means ^associated with each : taraet. 
circuit device for flipping a current '.stare: o.f said ' 
transaction acknowledge line, to an opposite .state 
whsneyer; a- command designated for a .particular ^'ta^rcet 
15 circuit- device,- is. received by :uhat .de.vice> nonreceipt "of • 
a command by a designated, targex :circuit device ^being . ~ 
indicated by the. stace of . said , transact ion. acknowledge,.- ^ :.■ 
line remaining-unchanged. ■ r: , .: . . . . 

2. -The transaction acknowledge, c-ircuirrv of claim .1,. 
further comprising a bus.. idle :ce fault- .device attached. -to .:. : 
said^.bus, and connected - to. drive .:s.aid' -transac-ta-on • . . 
acknowledge line -to its -.opposite state, .whenever said bus 
25 is idle: 



3. The- transaction acknov/Ledge.- ^circuitry of claim 2 
v/herein said bus idle default device includes means for 
monitoring said transaction acknov/ledge line and 
generating a dummy response v/henever nonreceiot of a 
corrimand is indicated. 
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4. In 'an' integrated circuit architecture having an on- 
chip bus with multiple circuit . devices . attached to the 
buS; whereby commands and data are transferred between 
said circuit devices over the bus, the bus being a split 
transaction bus for~ data ■ read operations,, a. synchronous • 
transaction acknowledge (TACK) system, v/ith nonrcsppnse 
detection circui-try for'^ determining receipt by a 
designated device of a command or data placed on said 
bus, the TACK system comprising: 

a TACK -line associated with said on-chip bus, 
the TACK line having cwo .opposite st-ates; ■ 

bus interface means associated with each 
circuit device for flipping the current .srater of the TACK 
line to its opposite- 'state whenever, a: 'circuit - device 
receives -a command or data incended for that circuit 
device ; . \. -„ ; - 

a bus idle default device (BIDD) attached to 
said bus for flipping the current state of the TACK line 
to its opposite scace whenever said bus is idle; and..'*- ■ 

nonresponse' detection mea;n3 'for monitoring the 
state of said TACK line, --nonreceipt of a command or data 
by a designated circuit device being indicated whenever - 
the state of ■ said' 'TACK line remains "unchanged. 

5. The TACK system of claim 4 wherein .said nonresponse 
detection ■ m.eans includes means for generating dummy data 
in response to- nonreceipt of -a-, comtmiand . and sending said-, 
dum.my data to said circuic device "that originated said 
command, said dummy data indicating said nonreceipt of 
said c omimta nd . 
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6. The TACK system of claim 4 wherein said nonresponse 
detect ion- means is a part of said BIDD. . 

5 7. The TA'CK system of claim:4 v/herein^ said nonresponse 
(^erection means comprised, detection circuits associated^ - 
with each of said circuit devices attached to- said bus-. 

10 8. The TACK system of ciaimi 4 wh-er^ein .said bus 

interface means associated' with each circuit :d8vice; has- - 
means for comparing an- addres.s . field of., any -command 
placed on said bus against an addres.s range to which that 
circuit device • will- respond, -ana whenever there is a 

15 match transferring, said command, to the circuit , device and 
flipping the state of the said TACK line. 

9. The -TACK system o-f .claim. 8 -v/herein - said means ■ for - 
20 flipping the -State of said TACK.line comprises:- 

" ~- a first rlip-fiop having. an input " connected to 
said TACK line "and a inverted output,. : - 

a second flip-flop having an inpu,t ..connected^ to . 
said address compared means and an output, both flip- 
2 5 flops being clocked by a clock for said bus, and 

• ' a tri-stace driver having- an input connected to 
said' inverted output of -said first- flip-flop, an enable 
connected to-said output -of said: second flip-flop, and an 
output connected to said T.ACK line. ■ ^ • - 

50 - - - 
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10. The TACK system of claim 4 wherein said nonresponse 
detection means comprises: ■ . , . 

a flip-flop clocked by a clock for said bus and 
having a input connected to said TACK line, and an 

output, ■. ■ ; 

an exclusive OR gate -with a f ir^t inuut 
connected to said TACK line, 'a second input' connected to 
the output of said flip-flop,:, and an output providing 
said indicative of nonreceipt of a comman or data on 
said bus. 



11. The TACK system of claim 4 wherein 'the .integrated 
circuit architecture forms a multi -processor system with 
some of the circuit devices attached to said on-chip bus 
being- processing clusters, the bus operating at a higher 
clock rate than the clusters.- 
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